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ABSTRACT 



An information management, retrieval and display system 
searches through an informational resource, such as a docu- 
ment (e.g., a treaty), a number of individual documents (e.g., 
Web pages resident on the Internet), or a stream of infor- 
mation (e.g., DNA code, source code, satellite data 
transmissions, etc.) and displays the results of the search in 
an collapsible/expandable format based upon a user-selected 
display criteria or hierarchy. Such a display hierarchy will 
allow the end-user to effectively and quickly obtain items of 
interest from the search results. Generally, the system per- 
forms a method for retrieving information from an informa- 
tional resource that includes the steps of: (a) dividing the 
informational resource into a plurality of finite elements; (b) 
assigning a categorical tag to each of the plurality of finite 
elements, where the categorical tag includes data pertaining 
to a content of the finite element; (c) generating a searchable 
database record for each of the plurality of finite elements, 
where each searchable database record includes at least one 
string contained within the finite element, where the string 
can be a word, a phrase, a symbol, a group of symbols, a data 
segment or the like; (d) supplying a search string; (e) 
searching the searchable database for searchable database 
records containing the search string; (f) arranging the results 
of the searching step in a hierarchal structure according, at 
least in part, to the data in the categorical tags assigned to the 
finite elements found in the searching step; and (g) display- 
ing the results of the searching step in the hierarchal struc- 
ture. 

32 Claims, 7 Drawing Sheets 
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INFORMATION MANAGEMENT, 
RETRIEVAL AND DISPLAY SYSTEM AND 
ASSOCIATED METHOD 

BACKGROUND 5 

The present invention is a computerized system and 
method for searching through and retrieving information 
from an informational resource; and more particularly, the 
present invention is an information management, retrieval 
and display system for searching through an informational 10 
resource and for displaying the results of the search in an 
collapsible/expandable format based upon a user-selected 
display criteria or hierarchy. 

An inherent drawback in many conventional search J5 
engines or search tools, such as Infoseek™, AltaVista™, 
Hotbot™, is that the results of the search are typically 
organized according to the number of hits that the search 
word or phrase made in each document (Web page) being 
searched. This type of search result display requires the end 2Q 
user to go through the bits one by one in order to finally 
access the document he/she was looking for. Another draw- 
back with such conventional search engines is that the 
results of the search do not take into account that a word may 
have several different meanings, and may be used in many ^ 
different contexts. For example, if an end user were looking 
for information on a cartoon mouse, because the search 
query would contain the word "mouse," the list of hits will 
include documents for the electronic cursor-control devices, 
documents providing biological information on mice, docu- ^ 
merits providing pet information on mice, etc. Therefore, the 
end user may have to go through an enormous number of 
these hits before finally (if ever) reaching a hit related to the 
cartoon mouse. 

Thus, there is a need for a search engine or search tool that 35 
allows the end user to that arranges the search results in a 
manner that allows the end user to effectively and quickly 
obtain items of interest 



SUMMARY 



40 



The present invention is an information management, 
retrieval and display system for searching through an infor- 
mational resource, such as a document (e.g., a treaty), a 
number of individual documents (e.g., Web pages resident 
on the Internet), or a stream of information (e.g., DNA code, 45 
source code, satellite data transmissions, etc.) and for dis- 
playing the results of the search in an collapsible/expandable 
format based upon a user-selected display criteria or hier- 
archy. Such a display hierarchy will allow the end-user to 
effectively and quickly obtain items of interest from the so 
search results. The type or format of the informational 
resource is not critical. 

The invention includes four primary modules, a break 
module, an indexing module, a search module and an 
un-break module. The break module is an expert system 55 
operating upon a set of expert rules that define its operation. 
Hie break module parses through the informational resource 
to break up the informational resource into finite elements 
(such as paragraphs, sections, sub-sections, segments etc.). 
The break module also creates categorical tags for each of 60 
these finite elements, where the categorical tags assigned to 
each of the finite elements are based upon and analysis 
(defined by the set of expert system rules) of the contents of 
each of the finite elements. The categorical tag can include 
a standard classification such as, for example, "Dewey 65 
Decimal-type" number. The categorical tag can also include 
an organizational attribute (such as pertaining to the type or 



location of the finite element with respect to the rest of the 
rest of the informational resource), a date-stamp, a categori- 
cal word, etc. Preferably, the categorical tags are inserted 
into the finite element. 

The index module parses through the finite elements 
identified/crea ted/processed by the break module and cre- 
ates a searchable database having a database record for each 
of the finite elements identified by the break module. The 
searchable database is a type of reverse index, where each 
record includes an address or location of the corresponding 
finite element (and, in turn, the categorical tag included 
therewith), and strings (such as words, phrases, etc.) con- 
tained in the finite element and their frequency (i.e., their 
weight) within the finite element. 

In applications where the users of the invention do not 
have control of the information being searched (i.e., Web 
pages on the Internet), each .database record may also 
include the categorical tag, since the categorical tag will not 
be able to be inserted by the break module into the finite 
elements themselves. Furthermore, with the Web search 
application, it may not be necessary to utilize the break and 
un-break modules at all since each Web-page or link might 
be considered a finite-element for the purposes of the present 
invention. 

Once the reverse index is created, a search of the reverse 
index may be performed. Key strings (such as key words, 
phrases or symbol segments) may be supplied by an end user 
as a search query, and a display hierarchy or criteria may 
also be selected or defined by the user. The selected display 
criteria will instruct the search module how to manipulate 
the data of the search results. Specifically, the selected 
display criteria will define if the search results are to be 
displayed in an order or structure based entirely upon the 
information contained within the categorical tags (research - 
centric), if the search results are to be displayed in an order 
depending entirely upon the frequency of the key strings 
present in the finite elements (conventional), or if the search 
results are to be displayed in an order or structure based 
upon a combination of the two (document-centric). 

The search module accesses the search query and searches 
through the reverse index for database records matching the 
specific search term or query. The search results are then 
displayed in an collapsible/expandable (tree) structure by 
applying the information in the categorical tags for each of 
the finite elements satisfying the search criteria to the 
selected display hierarchy. For example, if the selected 
hierarchy is a document-centric hierarchy, a first level of the 
display hierarchy may be, for example, the year in which the 
finite element was created; a second level of the display 
hierarchy may be, for example, the order in which the finite 
elements appear in the document; and a third level of the 
display hierarchy may be, for example, based upon the 
frequency in which the search words appear in each of the 
finite elements. The operation of the search module, as with 
the break and index modules, is based upon a set of expert 
rules. Therefore, if the search results are not satisfactory, the 
expert rules in the break, index and/or search modules are 
modified and the procedure is performed again. 

Once one of the finite elements in the search result display 
are selected by the end-user, the un-break module allows the 
end user to view a contiguous portion of the informational 
resource that the selected finite element belongs to. The 
un-break module will assemble selected finite element with 
other related finite element to reconstruct the contiguous 
portion of the informational resource. The un-break module 
refers to the categorical tag of the selected finite element for 
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information related to the location of the finite element with inserted into the database record if there is a significant 

respect to the entire informational resource, and will then difference. Likewise, the index module can be configured to 

build a portion of the informational resource from all of the filter out Web sites having undesirable or unsavory content 

finite elements belong to that portion. For example, if the as indicated by the embedded categorical tags or as deter- 

selected finite element is a paragraph of a document, the 5 mined by a review of the content of the Web page itself, 

iin^ikmodiite^ ^ olh€T embodiment of ^ Mention, the dynamic 

the document to which the paragraph belongs. As with the ^ ^ ^ ^ d to accept and mdex ^ 

other modules of the present invention, the operation of the maimer of static ^ information (such as news- 

un-break module is controlled by a set of expert rules that fec ^ ^ transmissions, etc) on a global scale where an 

may be modified if the results are unsatisfactory. io end-user will be able to efficiently and quickly obtain any 

It is envisioned that the rule sets will be created and ^ of information he/she wishes from a hierarchal search 

refined by an expert on the document or information being ^nlt display based upon a categorical organization scheme 

searched. For example, if the system of the present invention sucrj ^ ^ Dcwey-Dccimal system. 

is to be associated with \folume 37 of the Code of Federal ~- _ Ct , . . , , 

„ , . . . . « . , , x . . Thus, m one aspect or Ihe present invention, a method for 

Regulate mdmdual (or group of .ndividuab) w, h ,5 information from an informational resource com- 

intimate knowledge of the volume would be best suited to v ° . c / v , . . . , . , 

- - - . ^ „ - ^ . . p rises the steps ot: (a) dividing the informational resource 

generate and fine-tune the rule sets. The fine-tuning of the f . . fC V . f « v . . . . 

& . ^ ... . . . . . b r into a plurality of finite elements; (b) assigning a categorical 

rule sets would involve the individual continuously perform- t 4 r . J r , 7V> 4 , * . , & 4l _ 

. . • 1 j tag to each of the plurality of finite elements, where the 

ine example searches on the vfohime using the rule sets, and ~ . t 4 . /, , ~ _ . . 4 ' 4 - 4 , 

° . r . .... . . ........ categoncal tag includes data pertaining to a content of the 

continuously modifying the rule sets to obtain the search 20 * ■ i r \ r . , , 4 . , 

1. .1. j-j * ^ .u 1 finite element; (c) generating a searchable database record 

results with the desired content and format. Once the rule , . * 1 r ^ t . . . 

. . . - ^ j 4 , . j , riL . for each of the plurality of finite elements, where each 

sets have been fine-tuned, the search module of the present . , , . . . . , 4 . 4 . 

. , t_ j 1 - lL ,l iLi j u searchable database record includes at least one string 

invention can be packaged along with the volume and sold . . , ... - . . , . T 

,. A , *^ l i_i w 1 t m * i_ contained within the finite element, where the string can be 

or distributed as a searchable vbhime. Likewise, the search , . - , , , 4 

. n . . . 4 ... a word, a phrase, a symbol, a group of symbols, a data 

module could operate on a Web-site so that users can access 25 4 4 . 7v /j . ✓ \ . 

*u t»7 u j _r 4 . , e - .u segment or the like; (d supplying a search string; (e) search- 

the Web-site and perform searches on the Volume. Since the . °\1 . ' v 4 . rr i . . *T . 

, , . .. , r , - , •, . mg the searchable database for searchable database records 

rule sets have already been defined and fine-tuned by the 4 . . 4| _ , 4 . /£v 4 , l4 _ r 4l _ 

a „ 4l _ u i_ 1 11 i_i i_ containing the search string; (1) arranging the results of the 

experts, the users would have a fully operable search ^T- 4 . . ^ . ' 4 , 

• 7l . _r_ . j j- 1 u ■ searching step m a hierarchal structure according, at least in 

engine that performs searches and displays results in accor- ^ ~~f , \ . 4 , 4 . . 4 , 4 4| _ c . 

r ^, . . r . parL to the data m the categoncal tags assigned to the unite 

dance with an expert s intimate knowledge with the volume. 30 , . , • 4 , . ^T.- * 4l _ 

, r , . . , .... ... elements found in the searching step; and (g) displaying the 

As mentioned above, it is also envBioned I thtf an embodi- of ^ x&icbi ^ ia tae hierarchal structure, 

ment of the invention is designed to search through a number ^ . „ . , . , . 

of individual Web pages resident on the Internet and to "if^ttonal resource may be a single document, a 

display Ihe results of the search in an collapsible/expandable if"^ of d" 0 ™^ ° r a s 1 " 3111 of data and the step of 

format based upon a user-selected display criteria or hier- 35 rientifying the fmite elements may include the steps of 

archy. In such an embodiment, a break module in the form **<*>*P*S sections or sub-sections witmn the document(s) 

described above may not be necessary because each Web ° r ^ stIC ™ " ld f I,, ^ mg * e d 1 ocuments 

page may already be considered to be a "finite element" and themselves. Tlje step of dmding the informational resource 

the search engine will not be able to modify the Web pages. mt0 a P 1 "^ of finite elements is preferably performed by 

Accordingly, in such an application, the index module will 40 an ex P ert svstem accordm S to a ^ and ,? he s ' e P ° f 

parse through each of the Web pages (finite elements) to categoncal tag to each of the plurality of finite 

create a searchable database having a record for each of the e ^ e " ts 15 3)50 P«fe«bly performed by an expert system 

Web pages. Each record in the searchable database will *«ording *> another rule set. If unsatisfactory results are 

include the Web address of the Web page, non-common obtained in step (g) above, one or both of the rule sets may 

words contained in the Web page along with their frequency 45 £ m °J^ y thCCnd 30(1 the steps (a) through (g) may 

(weight), and a categorical tag, as described above, which pertormeo again. 

includes data based upon an analysis of the contents of the Each database record preferably includes an address or 

Web page. The index module will also review each of the pointer to the corresponding finite element and further 

Web pages to determine if the creator of the Web page had preferably includes all of the non-common strings (e.g., 

embedded a categorical tag into the Web page itself; and if 50 words or phrases) contained within the corresponding finite 

such an embedded categorical tag is found, the index module element along with the frequency that such strings appear, 

may simply insert this pre-defined categorical tag into the In another aspect of the present invention, a method for 

database record rather than creating one itself. Therefore, as retrieving information from an informational resource 

the present invention becomes more prevalently used on the includes the steps of: defining a first rule set for dividing the 

Internet, Web page creators may desire to create their own 55 informational resource into a plurality of finite elements; 

categorical tags for their Web pages rather than having the utilizing the first rule set, dividing the informational 

search engine create one for them. With this feature, the Web resource into a plurality of finite elements; defining a second 

page designer may be able to influence the search results, rule set for creating a categorical tag for one of the plurality 

perhaps to achieve a more accurate depiction of the Web site. of finite elements; utilizing the second rule set to create a 

Of course, such a feature may also be used by the Web 60 categorical tag for each of the plurality of finite elements; 

designers in a deceptive manner, where the categorical tag generating a searchable database including a searchable 

will cause the Web page to be listed in search results when database record for each of the finite elements; searching the 

the searcher is looking for an entirely different type of searchable database for relevant database records; associat- 

information. Recognizing this potential problem, the index ing the relevant database records found in the search with 

module will include an option where it will compare the 65 corresponding relevant finite elements; selecting a hierarchy 

actual contents of the Web page against the embedded for displaying identifying phrases pertaining to the relevant 

categorical tags, and will create a new categorical tag to be finite elements; ordering the relevant finite elements in the 
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hierarchy according, at leas! in part, to the categorical tag for 
each of the finite elements; and displaying the identifying 
phrases pertaining to the relevant finite elements according 
to the results of the ordering step. 

In another aspect of the present invention, a data storage 
device (such as a CD ROM) is provided, which comprises: 
an informational resource divided into a plurality of finite 
elements, where each of the finite elements includes a 
categorical tag and a database record assigned thereto, where 
the categorical tag includes data pertaining to a content of 
the finite element and the database record includes at least 
one string contained within the finite element; and also 
comprises software instructions programmed to retrieve and 
display at least a portion of the informational resource. The 
software instructions are configured to perform the steps of: 
supplying a search string, searching through the database 
records for relevant database records containing the search 
string, arranging the results of the searching step in a 
hierarchal structure according to the information in the 
categorical tags assigned to the finite elements correspond- 
ing to the relevant database records, and displaying identi- 
fying phrases for the finite elements corresponding to the 
relevant database records in the hierarchal structure. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a schematic flow-diagram representation of the 
operation of a first embodiment of the present invention; 

FIGS. 2A and 2B are flow-chart representations of the 
operation of the embodiment illustrated in FIG. 1; 

FIG. 3 is a flow-chart representation of an operation of a 
second embodiment of the invention, resident on a data 
storage device such as a CD ROM; 

FIG. 4 is a schematic flow-diagram representation of the 
operation of a third embodiment of the present invention; 
and 

FIGS. 5A and 5B are flow-chart representations of the 
operation of the embodiment illustrated in FIG. 4. 

DETAILED DESCRIPTION 

The present invention is an information management, 
retrieval and display system for searching through an infor- 
mational resource, such as a document (e.g., a treaty), a 
number of individual documents (e.g., Web pages resident 
on the Internet), or a stream of information (e.g., DNAcode, 
source code, satellite data transmissions, etc.) and for dis- 
playing the results of the search in an collapsible/expandable 
format based upon a user-selected display criteria or hier- 
archy. Such a display hierarchy will allow the end-user to 
effectively and quickly obtain items of interest from the 
search results. 

As shown in FIG. 1, in a first embodiment of the 
invention, the information management, retrieval and dis- 
play system includes four primary modules, a break module 
10, an indexing module 12, a search module 14 and an 
un-break module 16. Each of these processing modules are 
preferably expert engines operating upon a set of expert 
rules that define the operation of the individual module. As 
will be described in further detail below, the expert rules for 
these modules are preferably generated by a person or 
persons having intimate knowledge of the document or 
documents 18 being searched; and the fine tuning of the 
expert rules is a iterative process where the expert will 
modify or change the rules of one or more of the above 
modules if a search through the document or documents 
proves to be unsatisfactory. 
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The break module 10 parses through an informational 
resource, such as a group of documents 18 to break up the 
group of documents into "finite elements" 20a-2Q2. Each 
finite element is a user-defined "basket" of information from 
5 documents that is to be individually indexed and searched. 
The finite element is usually not a single word, phrase or 
symbol, but is a section or portion of an informational 
resource that can be identified and isolated by the break 
module. A simple example of a finite element would be the 
10 individual paragraphs of a document Other examples of 
finite elements would include sub-chapters of a document, 
individual pages of a document, and other types of identi- 
fiable sections of a document. In some instances, the finite 
element can be the entire document itself. The break module 
15 is also responsible for analyzing the contents of each finite 
element 20o-20 z and creating a categorical tag 22a-22z for 
each finite clement, which is to be inserted into the finite 
clement. The categorical tags 22a-22z may include a stan- 
dard classification based upon the content analysis such as, 
23 for example, a "Dewey Decimal 7 * type number, or some 
other categorical reference number. The categorical tag may 
also include an organizational attribute such as pertaining to 
the type of finite element or the location of the finite element 
within the document, a date stamp, a categorical word or 
25 phrase summarizing the contents of the finite element, etc. 
As will be discussed in detail below, the contents of each 
categorical tag provides information to the search module 12 
so as to assist the search module in creating the hierarchical 
display of the search results. 
30 The index module parses through each of the finite 
elements created by the break module and creates a search- 
able database 23 including a database record 24a-24z for 
each of the finite elements created by the break module. The 
searchable database 23 is a type of reverse index, where each 
35 record 24a-24z includes an address or location of the 
corresponding finite element and all words contained within 
the finite element (preferably excluding common words such 
as "and," "in,** "the,** . . . ) along with their frequency of 
appearance within the finite element (Le., their weight). 
40 At some point during the process, a user, which may be 
an end user or may be the expert developing the rule sets, 
will enter a search query 26 and an optional hierarchical 
selection 28. The search query may be any conventional 
search query as available to those of ordinary skill in the art 
45 and may include search words or phrases and/or operators 
tying the words together. The hierarchy selection informs the 
search module the type of display format that the user wishes 
to see the results displayed within. Specifically, the hierar- 
chy selection will inform the search module whether or not 
50 the search results are to be displayed in an order or structure 
based entirely upon the information contained within the 
categorical tags (research-centric), if the search results are to 
be displayed in an order depending entirely upon the fre- 
quency of the key words or phrases present within the finite 
55 elements (conventional), or if the search results are to be 
displayed in an order or structure based upon a combination 
of the two (document-centric). 

The search module will utilize the search query to search 
through the database records 2Aa-2Az so as to find the 
60 database records 30 matching the words or phrases in the 
search query. The search module will then, depending upon 
the selected hierarchy 28, display the search results 32 in an 
order or collapsible/expandable tree structure based upon 
information from the categorical tags 22 included in the 
65 finite elements 20 that are associated with the records 30 
matching the search query. For example, a first level of the 
display hierarchy might be ordered according to the chapters 
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of a document that the finite elements are contained within. categorical tags of the remaining finite elements to deter- 

lnfonnation regarding the chapters that the finite elements mine the other finite elements 36 that appear on the same 

are contained within will be resident within the categorical page as the selected finite element so as to display the entire 

tags associated with the finite elements. A second level of the page 38 rather than the single paragraph. Likewise, the 

display results may order the finite elements for each chapter 5 un-break module can group related finite elements together 

based upon the weight or frequency that the search words or m a contiguous chapter, section, or other contiguous iden- 

phrases appear within each finite element Therefore, on the msbh portion of the document or documents. Simply put, 

search results screen the end user will select which chapter ^ module is used for displaying the selected finite 

he or she element ^ OTntext ^ ^ remainiQg portions of the 

and the display will then expand to show the finite elements 10 - f , w 

* . t . i_ . . . , _ . lntormatronal resource, 
from that chapter matching the search query. These finite 

elements contained within this chapter will be ordered While it is preferred that the un-break module is utilized 

depending upon the weight of the search query or words. to reconstruct contiguous portions of the informational 

From there, the user will make a selection 34 indicating to resource, it is within the scope of the invention to configure 

the un-break module 16 which of the finite elements the user is the expert rule sets of the un-break module to construct new 

wishes to view. informational resources using the selected finite elements 

It will be appreciated by those of ordinary skill in the art and other finite elements from the original informational 

that the different combinations of ordering schemes and tiers resource. For example, the un-break module may be con- 

for any given hierarchy is virtually limitless. Other ejamples figured to compile all of the finite elements matching the 

of ordering schemes and tiers can be based upon the topic of 20 search query into a new informational resource, using the 

the finite element, the author or provider of the finite categorical tags for these finite elements to dictate the order 

element, the time/date of the finite element, the position of in which the finite elements will be compiled. In another 

the finite clement with respect to the information resource, example, the un-break module may be configured to review 

etc. It is also within the scope of the invention that the the categorical tag of the selected finite element to determine 

hierarchy only include one tier or level of ordering. 25 other finite elements that are related to the selected finite 

While it is preferred that the search module displays the element based on the date that the finite elements were 

search results in an coUapsible/exparxlible tree structure, it created, or the author/owner of the finite element, or the 

is also within the scope of the invention that the display content of the finite element; and the un-break module will 

results be displayed in alternate hierarchal or tiered struc- then construct a new informational resource compiling all of 

hires. An example of an alternate hierarchal/tiered structure 30 the related finite elements. 

is the use of a cascaded or tiled display to present the various FIGS. 2A and 2B provide a flow chart representation of an 

evek of the hierarchy ^Of course, if there is only one tier or uon of me in f orma tion management, retrieval and 

level ol .ordering, the ^ display structure would not need to be dis play system for me embodiment described above. As 

coUapsible/expandible. 35 shown in functional block 40, a first step is to access the 

The search module may also be configured to recognize informational resource being examined. As illustrated in 

that a string in the search query may have other permeations, functional block 42, the next step is to select the appropriate 

which may be used by the search engine to provide matches expert rule sets to apply for searching through the informa- 

with the database records. For example, if the search query rional resource. The particular rule set selected will depend 

includes a word in a first language, it is within the scope of ^ upon the type of information resource that was accessed in 

the invention for the search module to provide the word in step 40. For example, a set of expert rule sets used for 

other languages when looking for matches with the database searching through and analyzing the Antarctic Treaty will be 

records. Likewise, it is within the scope of the invention for different than a set of rule sets used for analyzing and 

the search module to provide other known forms or teases of searching through volume 37 of the Code of Federal Regu- 

the word; and it is also within the scope of the invention for 45 lations. As shown in functional block 44, the next step is to 

the search module to provide other search words having a break the information resource into a plurality of finite 

similar or the same meaning. elements according to a first set of the expert systems rules. 

The un-break module 16 accesses the categorical tag of As discussed above, this step involves breaking the infor- 
the selected finite element 34 to determine the other finite mational resource into identifiable segments of information 
elements 36 of the documents 18 that are to be grouped 50 such as paragraphs, subsections, pages, chapters, subchap- 
together so as to form the single contiguous display 38. For ters and the like. An example rule set for breaking the 
example, if the selected finite element 34 is a paragraph of Antarctic Treaty into a plurality of finite elements is pro- 
file document, the un-break module 16 will refer to the vided below in Table 1. 



TABLE 1 

FINISHED EXAMPLE OF A 'RULE SET* FOR AUTOMATICALLY DIVIDING 
DOCUMENTS INTO SEGMENTS OR ELEMENTS 1 



SPECIFIC 
DOCUMENT DOCUMENT 
DIVISION LEVELS DIVISIONS 



PATTERN MATCHING RULES 



Primary Level Antarctic Treaty, Recognize by bolded large foots centered on f, 

Conventions, Protocol 
and its Annexes 

Secondary Level Recommendations, Recognize by Roman numerals 

Measures, etc 
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FINISHED EXAMPLE OF A 'RULE SET FOR AUTOMATICALLY DIVIDING 
DOCUMENTS INTO SEGMENTS OR ELEMENTS 1 

SPECIFIC 
DOCUMENT DOCUMENT 

DIVISION LEVELS DIVISIONS PATTERN MATCHING RULES 



Tertiary Level Articles within Recognize by medium fonts centered on page with a colon 

documents from the 
primary or secondary 
levels 

Grouped Level Antarctic Treaty Group documents by their Roman numerals 

Consultative Meeting 

Appended Level Year Append the signature date far documents at the primary, 

secondary or grouped levels 



'Based on the public-domain documents in the Antarctic Treaty Hand boot which has been published since 
the 1960's by the United States Department of State in hardcopy form only and which now has been con- 
verted into a searchable database, 

2 Source codes arc described using JAVA but could easily be written in PERL or any other programming lan- 
guage. See Appendix A for example source code segments 



As shown in the above tabic, the example rule set is 
adapted to divide the Antarctic Treaty into a plurality of 
levels where a primary level of the Treaty, which involves 
the Antarctic Treaty, Conventions, Protocol and its Annexes, ^ 
is recognized by the search engine by identifying bold, large 
font centered on a page. A secondary level, which are the 
Recommendations and Measures contained within the 
Treaty, arc recognized by the search engine by identifying 
Roman numerals. A tertiary level is utilized to divide up the 
primary and secondary levels into smaller finite elements. 30 
This tertiary level of finite elements is recognized by the 
search engine by identifying medium fonts centered on a 



iterative manner utilizing feed-back loops as will be 
described below. 



As shown in functional block 46, a next step is to create 
a categorical tag for each of the finite elements based upon 
a positional and/or content analysis of the finite element 
according to a second set of expert system rules. An example 
of a rule set for defining categorical tags for finite elements 
extracted from the Antarctic Treaty is provided below in 
Table 2. 



TABLE 2 



EXAMPLE OF CATEGORICAL TAOS THAT WERE AUTOMATICALLY ATTACHED 
TO FINITE ELEMENTS CREATED WITH THE USER-DEFINED 'RULE SETS' 
(SEE TABLE lV 

DOCUMENT 



DIVISION LEVELS 


SPECIFIC DOCUMENT DIVISIONS 


Primary Level 


Antarctic Treaty, Conventions, Protocol and its Annexes 


Secondary Level 


Recommendation, Measures, etc 


Tertiary Level 


Articles within documents from the primary oi secondary levels 


Grouped Level 


Antarctic Treaty Consultative Meeting 


Appended Level 


Year 



1 Based on the public-domain documents in the Antarctic Treaty Handbook which has been 
published since the 1960*8 by the United States D ep ar tm ent of State in +hardcopy form only 
and which now has been converted into a searchable database. 

2 Source codes axe described using JAVA but could easily be written in PERL or any other 
programming boguagc. Sec Appendix A for example source code segments. 



page with a colon. The remaining levels of the table should 
be apparent to those of ordinary skill in the art upon 
analyzing the table and the associated pattern matching 
rules. 55 

Accordingly, the purpose of the above rule set is to create 
an automatic tool for matching patterns that distinguish 
hierarchies, segments or elements within any type of infor- 
mational resource. The rule set is developed in relation to 
user-defined requirements for the segments or elements that 60 
need to be indexed and searched within the informational 
resource. It will also be apparent to those of ordinary skill in 
the art that the rule sets will be greatly simplified in 
informational resources that include already distinguished 
segments or elements, such as in separate columns or blocks. 65 
Preferably, the rule sets are designed by an expert having 
intimate knowledge of the informational resource, in an 



As shown in Table 2, the categorical tag will include 
notation indicating the finite element's position within each 
of the various identified levels of the Antarctic Treaty. For 
example, the categorical tag will include information indi- 
cating if on a primary level, the finite element is contained 
within the Antarctic Treaty, the Conventions, the Protocol or 
its Annexes. On a secondary level, the categorical tag will 
indicate whether or not the finite element is included in the 
Recommendations, Measures, etc. As shown in the bottom 
of the table, the categorical tag for each of the finite elements 
will also include a content base notation indicating the year 
that the particular section or finite element was created. Of 
course, the type and variations of positional and/or content 
base notations included in the categorical tags are virtually 
limitless. For example, the rule set may be configured to 
analyze the contents of the finite clement so as to provide a 



07/16/2003, EAST Version: 1.04.0000 



US 6,1' 

11 

categorical word or phrase which provides a chie to the user 
as to the contents of the finite element. Similarly, rather than 
utilizing a word or phrase, the rule set can analyze the 
contents or position of the finite element to provide a 
categorical reference number to the finite element, such as a 
Dewey Decimal type number. 

As shown in functional step 48, a next step is to insert the 
categorical tag created above in step 46 into the finite 
element created in step 44. As shown in functional block SO, 
a next step is to generate, for each of the finite elements, a 
searchable database record. Each database record preferably 
contains the noncommon strings (e.g., words* phrases, 
symbols) contained within the finite element along with their 
frequency (i.e., weight). Furthermore, each database record 
will include an address, location or link to the corresponding 
finite element. As shown in functional, block 52, a next step 
is to enter a search string such as a word, phrase or symbol(s) 
and to select a display hierarchy. As shown in functional 
block 54, a next step is to search through the database 
records created in functional block 50 for matches between 
the search string and the noncommon strings of the database 
records. This searching step will identify the relevant data- 
base records having noncommon strings matching the search 
string. As shown in functional block 56, the relevant data- 
base records found in the searching step 54 will be ordered 
by applying information from each of the categorical tags of 
the relevant database record's associated finite element to 
the selected display hierarchy and/or by applying the weight 
of the matching search strings in the relevant database 
records to the selected display hierarchy. 

For example, a first level of the display hierarchy for the 
Antarctic Treaty might be the year that the finite element was 
created; the second level might be ordered according to the 
order of the Articles of the Antarctic Treaty; and a third level 
of the display hierarchy might be ordered according to the 
weight of the matching strings contained within the database 
records. 

As shown in functional block 58, a next step would be to 
display the search results in the collapsible/expandable 
hierarchy on a display screen. As shown in functional block 
60, the user will determine whether the search results were 
satisfactory, and if not the process will advance to functional 
block 62 where the user will modify one or more of the rule 
sets and will return either to functional block 44 or to 
functional block 52 depending upon which rule sets have 
been modified. 

If, in functional block 60, the search results are 
satisfactory, the process will advance to functional block 64 
where the user will select one of the finite elements from the 
search results display. Then in functional block 66, the 
categorical tag of the selected finite element will be used to 
identify other finite elements that are to be grouped together 
with the selected finite element to create a contiguous 
portion of the informational research to be displayed. 
Finally, in functional block 68, the contiguous portion of the 
informational resource will be displayed on the display 
screen or printed. 

It is envisioned that an expert having intimate knowledge 
of the informational resource will develop the rule sets based 
upon his or her knowledge of the informational resource. 
Thereafter, once the rule sets have been fully developed, the 
feed-back portion of the above-described flow chart will no 
longer be necessary. 

Furthermore, once the rule sets have been fully 
devebped, the search module, the un-break module and the 
fully developed rule sets may be incorporated onto a data 
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storage device (such as a CD ROM, a disk-drive, a floppy- 
disk arid the like) along with an informational resource 
pre -broken into its plurality of finite elements, where each of 
the finite elements includes the corresponding categorical 

5 tag previously created therefore, along with the pre -created 
searchable database for the plurality of finite elements. 
Therefore, such a storage device would essentially provide 
a searchable document that includes the entire content of the 
informational resource along with a search engine that has 

10 been fined tuned by an expert with intimate knowledge of 
the informational resource, so that end users of the CD ROM 
(or other type of storage device) would be able to take 
advantage of the expert's knowledge and experience in 
searching through the informational resource contained 

15 therewith. 

As shown on FIG. 3, a flow chart representation of an 
embodiment of the invention resident on a data storage 
device, such as a CD ROM, is presented. Essentially, this 
embodiment is equivalent to the embodiment described in 

20 FIGS. 2A and 2B above, except that the development of the 
rule sets are not longer required As shown in functional 
block 52', a first step would be for the end user to enter a 
search string and select a display hierarchy. In functional 
block 54', the next step would be for the search module to 

25 search through the database records contained on or down- 
loaded from the CD ROM to match the search string with the 
non-common strings contained in the searchable database 
records. As shown in functional block 56* the next step 
would be for the search module to order the search results by 

30 applying information in the categorical tags of the matching 
finite elements (which are contained in, or are downloaded 
from the CD ROM) and/or by applying the weight of the 
matching strings to the selected display hierarchy as dis- 
cussed above. As shown in functional block 58' the next step 

35 is to display the search results in preferably a collapsible/ 
expandable hierarchy. As shown in functional block 69, the 
end user, upon viewing the search results will determine 
whether or not the results are satisfactory. If not satisfactory, 
the process will return to functional block 52* where the end 

40 user will input a new search string and/or will select a new 
display hierarchy. If the display results of step 58* are 
satisfactory, the process will advance to functional block 64' 
where the end user will select one of the finite elements from 
the search results display. Advancing to functional block 66* 

45 the un -break module will reconstruct the portion of the 
information resource that includes the selected finite element 
by accessing the selected finite element and the other 
surrounding or related finite elements from the CD ROM to 
create the contiguous portion of the informational resource 

50 that included the finite element. 

In another embodiment of the present invention the infor- 
mation management, retrieval and display system may be 
specifically configured to search through a number of indi- 
vidual Web pages resident on the Internet and to display the 

55 results of the search in an collapsible/expandable format 
based upon a user selected display criteria or hierarchy. In 
such an embodiment, a break module in the form described 
above may not be necessary because each Web page may 
already be considered a "finite element" and the search 

60 engines will not be able to modify the Web pages. Therefore, 
with such an embodiment, the search engine will also not be 
able to insert the categorical into the finite elements. 
Therefore, in this embodiment, the categorical tags may be 
either stored separately from the finite elements or incorpo- 

65 rated directly into the database records. Furthermore, it is 
envisioned that the Web page creators may desire to create 
their own categorical tags for their Web pages rather than 
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having the search engine create ace for them. With this 
feature, the Web page designer maybe able to influence the 
search results, perhaps to achieve a more accurate descrip- 
tion of his or her Web site. Of course, in such a feature may 
also be used by the Web designers in a deceptive manner, 
where the categorical tag will cause the Web page to be 
listed in search results when the searcher is looking for an 
entirely different type of information. Recognizing this 
potential problem, the index module may include an option 
where it will compare the actual contents of the Web page 
against the embedded categorical tags inserted by the Web 
page designer, and may create a new categorical tag to be 
inserted in the database record for the Web page if there is 
a significant difference between the two. Likewise, the 
search engine can be configured to include an optional filter 
that will filter out Web sites having unsavory contents as 
indicated by the embedded categorical tags or as determined 
upon a review of the content of the Web page itsel£ 

As shown in FIG. 4, in such an embodiment of the 
invention, the information management, retrieval and dis- 
play system includes two primary modules, an index module 
70 and a search module 72. Each of these processing 
modules are preferably expert engines operating upon a set 
of expert rules that define the operation of the individual 
module. The index module 70 will periodically crawl 
through the volume of Web pages 74 utilizing a conventional 
Web crawling or Web searching technology such as a spider 
technology, which is adapted to examine each Web page (or 
as many as possible) provided on the Internet. As shown in 
FIG. 4, several of the Web pages may include a predefined, 
embedded categorical tag 76 included therewith. As dis- 
cussed above, such an embedded tag 76 would be inserted 
in the Web page by the Web page designer so that the search 
engine of FIG. 4 would utilize tms predefined embedded 
categorical tag rather than creating one on its own. An 
example of a rule from the expert role set for defining the 
categorical tag in this embodiment would be to identify the 
most prominent word or phrase on me initial screen appear- 
ing when the Web site is accessed. 

The index module 70 will also create a searchable data- 
base 78 including a database record SOaSQz for each of the 
Web pages accessed above. This searchable database 78 is a 
type of reverse index or each record 80a-8Qz includes a link 
to a corresponding Web page, all words contained within the 
Web page (preferably excluding common words) along with 
their frequency of appearance within the Web page, and a 
categorical tag created by the index module or a copy of the 
categorical tag that was included in the particular Web page 
as described above. It is envisioned that the index module 
would constantly be re-accessing the Web pages 74 and 
updating the searchable database 78, since the contents of 
Web pages are also constantly being updated or changed. 

When a user wishes to conduct a search using the search 
engine, the user will enter a search query 82 and select an 
optional hierarchical selection 84. The search query may be 
any conventional search query as available to those or 
ordinary skill in the art, it may include a search word or 
phrases and/or operators tying the words together. The 
hierarchy selection informs the search module the type of 
display format that the user wishes to see the results dis- 
played within. Specifically, the hierarchy selection will 
inform the search module whether or not the search results 
are to be displayed in an order or structure based entirely 
upon the information contained within the categorical tags 
(research-centric), if the search results are to be displayed in 
an order depending entirely on the frequency of the key 
words or phrases present within the finite elements 
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(conventional), or if the search results are to be displayed in 
an order or structure based upon a combination of the two 
(document-centric). 

The search module 72 utilizing a search query 82 to 

5 search through the database records, 80aSQz so as to find 
the database records 86 matching the words or phrases in the 
search query. The search module will then, depending upon 
the selected hierarchy 84, display the search results 88 in an 
order or in a collapsible/expandable tree structure based 

10 upon information from toe categorical tags 89 included 
within the database records 87 matching the search query. 
From the display 88, the user will make a selection 90 of a 
link to a Web page that he or she wishes to view and the 
search module will then display the Web page 92 on the 

15 display screen. 

FIGS. 5A and 5B provides a flow chart representation of 
an operation of the embodiment described above in FIG. 4. 
As illustrated in the function block 94, a first step is to access 
a Web page on the Internet. In functional block 96, the next 

20 step is to determine whether the access Web pages includes 
an embedded categorical tag. If the Web page includes an 
embedded categorical tag the process would advance to 
functional block 98 where the process will determine 
whether the embedded categorical tag is consistent with the 

25 content of the Web page. If the Web page does not include 
an embedded categorical tag or if the categorical tag is not 
consistent with the content of the Web page, the process will 
advance to functional block 100 where a categorical tag will 
be created for the Web page. If the embedded categorical tag 

30 is consistent with the content of the Web page in step 98 or 
if the categorical tag is created for the Web page in step 100, 
the process will advance to functional block 102 where a 
searchable database record will be generated for the Web 
page. This searchable database record will include the 

35 non-common words or phrases contained within the Web 
page and their frequency (i.e., weight) a link to the Web page 
and the categorical tag embedded within the Web page or 
created in step 100 above. The process will then advance to 
functional block 104 to determine whether a next Web page 

40 is to be accessed. If so, the process will return to functional 
block 94. If the searchable database is complete, the process 
will advance to functional block 106 where a user will enter 
a search word or phase in selected display hierarchy. . 
Advancing the functional block 108, the search engine 

45 will search through database records for matches between 
the search word or phrase and the non-common word or 
phrases contained within the database records. Advancing to 
functional block 110 the search engine will then order the 
results of the search by applying the information in the 

50 categorical tags matching database records to the selected 
display hierarchy and/or by applying the weight of the 
search word or phrase in each of the matching database 
records to the selected display hierarchy. Advancing to 
functional block 112, the next step would involve displaying 

55 the search results on the display screen. In functional block 
114, if the search results are satisfactory, the user will select 
a Web page link on the display screen and the search engine 
will display the associated Web page selected. If the search 
results are unsatisfactory, the process will advance to func- 

60 tional block 118 where the user will enter a new search word 
or phrase and/or select a new display hierarchy and the 
process will return to functional block 108 so that another 
search can be performed. 

In the present embodiment, the expert rule sets for cre- 

65 ating the categorical tags, and the database records may be 
defined by an expert utilizing an iterative variation of the 
above process on a limited portion of the Internet (similar to 
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that as described in FIGS. 2A and 2B above). Once the rule The following Appendix provides example code segments 
sets have been refined, the rule sets can be applied to the for dividing the Antarctic Treaty into a plurality of finite 
entire Internet The above described search engine can be elements and for creating categorical tags for the finite 
operating on a Web site or may be contained in a memory elements 
device such as a CD ROM which can be downloaded onto 5 
a computer having access to the Internet. 

TABLE 1 

1) Sline — m/<PONT FACE^-Hctvcfica, ArialT SIZE^d>((U)V(\dVilV0^?(\^4d)?V?(\d\4d)?)-[a-zA-aVV_\3>y 

2) Sline — m/<P AUGNH'X2hrrTO">{(lDCVLDCM}f)\-([ <}*7))<\P>/ 

3) Sline — m/<P 

AiJGN^E*i^">oextract(s)?^^ 

4) Simple variable substitution*^ a A Hsshtable is generated nsing the code: 

# SUB: gcn_hashcs 
# 

# Purpose: lb generate a tables based oa a set pattern from an 

# files related to the Antarctic Treaty Project 

sub gen_hashes 

my SFUeList - @_[0fc 

print Tilelist: $FUeList<br>Vn"; 
print "Generating Year-Roman Numeral and Year- Major Document Name HashcaAn\n"; 
print "Processing"; 

while (SlnputFN - glob(**$FileList")) 
{ 

open (InFite, "<$InputFhO|)dfc£"Cannot open $InputFN\o$!"); 
print 

while (defined($line - <LnFile>)) 
{ 

chomp(Sline); 

# Remove unwanted characters from the line 
Sline ** rem aU(Sliiic); 

# print Sline; 

# Begin Matching 

# Attempt to grab the last for digits of a line, if they are on the line 

if (Sline — m/*"Aw+\w+\w+\w+^w+\w+\w+), ?(Vw+fw+\w+jVw+\w+),? ?(\djv\d\d)( 
\w+)?-(\d^d\d)(\w+)?\w+(U^W\d\d)/) 
{ 

# Debugging Code 

sprint "$1 $2 S3 $4 $5 $6 $7 $8 $9 SlOVn"; 
SMajorDoc - $1; 
Syeai - S7; 
if ("$year"ne ** ") 
{ 

if (defined($NextUne - <InF0e>» 
{ 

# Remove unwanted characters from the line 
SNextline - rcm_aJ]($NextLine); 

# Match the Roman Numerals at the beginning of the 

# next line 

if($NextLine -~ mT([IVXLCDM>)\) 
{ 

SRomanNumeral - $1; 

$YearTYans{$RomanNumeral} - Syear, 

$DocTrans{$ycar} - SMajorDoc; 
} 

} 

} 

} 

} 

close InFile; 

} 

print "Vn"; 

} 
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60 

Then, when the Dewey tag is written out, the ATCM is 
placed in the appropriate position, whether it be the first, 
second, or third level using the Year from the currently 
matched pattern to obtain the correct ATCM from the 
hashtable (DocTrans table specifically) as generated above. 65 
5) Simple variable substitutionaaa A Hashtable is generated 
using the code: 



# SUB: gen_hashes 
# 

# Purpose: To generate a tables based on a set pattern from an 

# files related to the Antarctic Treaty Project 
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-continued 



} 



} 



} 



} 

} 

close InFile; 

> 

print *^n"; 



10 



sub gen l3 hashes 
{ 

my SFileList - @_fl>fc 

print "FDtllst: SFQeUst<cbT>\n"; 
print "Generating Year- Roman Numeral and 
Year- Major Document Name Hashes.\nVn**; 
print " Processing** i 

while (WnputFN - glob(-$Filelisr)) 
{ 

open (InFUe, 

"<$InputFN^f)dieC*Cannot open $InpulFNVn$!"); 
print* 4 .**; 
while (defined($line - <InFfle>)) 
{ 

chomp($line); 

# Remove unwanted characters from the line 
$Kne • rem__all($line); 

# print $line; 
# Begin Matching 

# Attempt to grab the last for digits of a line, 
if they are on the line 

if (Sine — m/*(\w+\w+\w+\w-fj\w+Vw+\w+), 
?(Vw+ftw+\w+|Vw+\w+),? 7^d|v\d\dX 
\ w+ )7_^d|Vd\d)^w+)Aw^{\a*\dVd\dy) 
{ 

# Debugging Code 

#print "$1 $2 $3 $4 $5 $6 $7 $8 $9 $10\n"; 
SMajarDoc - $1; 25 

Syeai - $7; 
irX"$year" ne " ") 
{ 

if (defined($NextUne - <InFile>)) 
{ 

# Remove unwanted characters from the line 30 
SNextlinc - rem_all($NextLine); 

# Match the Roman Numerals at the 
beginning of the 

# next line 

if (SNextUne — m/"([rVXUCDM}f» 

{ 35 
$RomanNumcral - $1; 

SYeaiTrans { $ RomanNumtra! } - $year, 

$DocTrans{£year} - SMajoxDoc; 
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Then, when the Dewey tag is written out, the Year is 
placed in the appropriate position, whether it be the first, 
second, or third level using the Roman Numeral from the 
currently matched pattern to obtain the correct Year from the 50 
hash table (YearTrans table specifically) as generated above. 



18 



While the systems and methods described herein consti- 
tute preferred embodiments of the invention, it is to be 
understood that the invention is not limited to these precise 
forms, and that changes may be made therein without 
departing from the scope of the invention. 

What is claimed is: 

1. A method for generating a searchable informational 
resource comprising the steps o£(a) dividing an informa- 
tional resource into a plurality of finite elements, the divid- 
ing step (a) being performed by an expert system according 
to a first rule set; 

(b) assigning a categorical tag to each of the plurality of 
finite elements, the categorical tag including data per- 
taining to a content of the finite element, the assigning 
step (b) being performed by an expert system according 
to a second rule set; 

(c) generating a searchable database record for each of the 
plurality of finite elements, each searchable database 
record including at least one string contained within the 
finite element; 

(d) supplying a search string; 

(e) searching the searchable database for searchable data- 
base records containing the search string so as to 
provide search results; 

(£) arranging the results of the searching step in a hierar- 
chal structure according to the information in the 
categorical tags assigned to the finite elements corre- 
sponding to the searchable database records found in 
the searching step; 

(g) displaying the results of the searching step in the 
hierarchal structures; 

(h) modifying at least one of the first and second rule sets, 
responsive to viewing, by an expert having expert 
knowledge regarding content and structure of the infor- 
mational resource, unsatisfactory results in the display- 
ing step (g); and 

(i) repeating steps (a) through (h) until the expert views 
satisfactory results in the displaying step (g). 

2. The method of claim 1, wherein the informational 
resource is a single document and the step of dividing the 
informational resource into a plurality of finite elements 
includes the step of identifying a plurality of sections in the 
single document, wherein each finite element is a single one 
of the plurality of sections. 

3. Hie method of claim 1, wherein the informational 
resource is a plurality of documents and the step of dividing 
the informational resource into a plurality of finite elements 
includes the step of identifying the plurality of documents, 
wherein each finite element is a single one of the plurality of 
documents, 



TABLE 2 



1) <meta name-** Dewey" contemV'Seventeenth Antarctic Consultative 

2) cmeofnarne^Dewey- content-"CONSERVAnON OF ANTARCTIC MARINE LIVING 
RESOURCES'^ % %Convention on the Conservation of 

Antarctfc% % 1>%%%——1ft> 1981 Article VI Relationship to existing conventions relating to 

the conservation of whales and Eeals**> 

3) <meta name-* 1 Dewey" content-"Resohitions and Measure! adopted at the XDCth Antarctic Treaty Consultative 
Mccting% 9l i 3 b%Anncx C%% %%Rc9ohilioas%% %%ReaohUion 8**> 

4>cmeta name-" Dewey" contentV'Seventeeaih Antarctic Consultative 
Meeting ^ ■ mm /J b % t Ml ■ %% %1 mm .■■%%% l c — Article Xir> 
5)<mela came-" Dewey" content-" 1980% % %%Convcntion on the Conservation of 
Antarctic ^ -'' —%%%% 1980 Article 
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4. The method of claim 1, wherein the informational 
resource is a plurality of documents and the step of dividing 
the informational resource into a plurality of finite elements 
includes the step of identifying the plurality of documents 
and identifying sections within the plurality of documents, 5 
wherein each finite element an identified section. 

5. The method of claim 4, wherein the identified section 
is an entire one of the plurality of documents. 

6. The method of claim 1, wherein the informational 
resource is a data stream and the step of dividing the l0 
informational resource into a plurality of finite elements 
includes the step of identifying segments within the data 
stream. 

7. The method of claim 1, wherein the categorical tag 
includes data pertaining to an organizational attribute of the lS 
finite element with respect to the informational resource. 

8. The method of claim 1, wherein the categorical tag 
includes data pertaining to the location of the finite element. 

9. The method of claim 1, wherein the categorical tag 
includes a categorical phrase pertaining to the content of the ^ 
finite element 

10. The method of claim 1, wherein the categorical tag 
includes a categorical number pertaining to the content of 
the finite element 

11. The method of claim 10, wherein the categorical ^ 
number is based upon a categorical referencing system 
analogous to the Dewey decimal system. 

12. The method of claim 1, wherein the step of dividing 
the informational resource into a plurality of finite elements 
includes the step of identifying and distinguishing between ^ 
headings and sub-headings within the informational 
resource. 

13. The method of claim 1, wherein each database record 
further includes an address or link to the corresponding finite 
element. 35 

14. The method of claim 1, wherein the database record 
includes non-common strings contained within the finite 
element. 

15. The method of claim 14, wherein: 

the informational resource is at least one document; 49 
the database record includes non-common words con- 
tained within the finite element; and 
the step of supplying a search string involves a step of 
assigning at least one search word. 

16. The method of claim IS wherein each database record 45 
further includes word frequency data pertaining to the 
frequency of the non-common words contained within the 
corresponding finite element. 

17. The method of claim 16 wherein a hierarchal structure 

is a collapsible/expandable hierarchy, having at least two 50 
levels and the arranging step (f) involves the step of ordering 
the results of the searching step according to a first one of the 
levels and then further ordering the results of the searching 
step according to a next one of the levels. 

18. The method of claim 17, wherein at least one of the 55 
levels of the hicrarchal structure is ordered based upon the 
data included in the categorical tags assigned to the finite 
elements that are associated with the database records found 

in the searching step (e). 

19. The method of claim 18, wherein another one of the 60 
two levels of the hierarchal is ordered based upon the word 
frequency data for database records found in the searching 
step (e). 

20. The method of claim 1 wherein a hierarchal structure 

is a collapsible/expandable hierarchy, having at least two 65 
levels and the arranging step (f) involves the step of ordering 
the results of the searching step according to a first one of the 



levels and then further ordering the results of the searching 
step according to a next one of the levels. 

21. The method of claim 20, wherein at least one of the 
levels of the hierarchal structure is ordered based upon the 
data included in the categorical tags assigned to the finite 
elements that are associated with the database records found 
in the searching step (e). 

22. The method of claim 1, wherein: 

the informational resource includes a document; 

the step of dividing the informational resource into a 
plurality of finite elements includes the step of identi- 
fying a plurality of sections in the document, each finite 
element being a single one of the plurality of sections 
the categorical tag includes information pertaining to a 
location of the finite clement with respect to other finite 
elements in the document; and 

the step of arranging the results of the searching step in a 
hierarchal structure according to the information in the 
categorical tags assigned to the finite elements corre- 
sponding to the searchable database records found in 
the searching step, includes the step of arranging the 
finite elements in an order in which the finite elements 
appear in the document. , , 

23. The method of claim 1, wherein step (b) includes the 
step of inserting each categorical tag into the respective one 
of the plurality of finite elements. 

24. The method of claim 1, wherein step (c) includes the 
step of inserting each categorical tag into the database record 
corresponding to the respective one of the finite elements. 

25. The method of claim 1, wherein the supplying step (d) 
includes the steps of: 

providing an initial search string; and 

creating at least one additional search string having a 

substantially similar meaning or effect of the initial 

search string. 

26. The method of claim 25, wherein the initial search 
string is a word in a first language and the at least one 
additional search string is the word in a second language. 

27. The method of claim 25, wherein the initial search 
string is a word having a first meaning and the at least one 
additional search string is another word having a second 
meaning substantially similar to the first meaning. 

28. The method of claim 25, wherein the at least one 
additional search string is based upon a known permeation 
of the initial search string. 

29. A method for retrieving information from an infor- 
mational resource comprising the steps of: 

breaking apart the informational resource into a plurality 

of discrete finite elements; 
creating a categorical tag for each of the plurality of 

discrete finite elements; 
generating a searchable database including a searchable 

database record for each of the discrete finite elements; 
searching the searchable database for relevant database 

records; 

associating the relevant database records with corre- 
sponding relevant, discrete finite elements; 

selecting a hierarchy for displaying identifying phrases 
pertaining to the relevant, discrete finite elements; 

ordering the relevant, discrete finite elements in the hier- 
archy according, at least in part, to the categorical tag 
created for each of the discrete finite elements; 

displaying the identifying phrases pertaining to the 
relevant, discrete finite elements according to the 
results of the ordering step; 
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selecting one of the identifying phrases; 
accessing the discrete finite element corresponding to the 

selected identifying phrase; and 
constructing a new informational resource for viewing 
using the accessed, relevant discrete finite element and 
other related, discrete finite elements. 
30. The method of claim 29, wherein the other related, 
discrete finite elements are determined based upon informa- 
tion contained within the categorical tag of the accessed, 10 
relevant discrete finite element. 
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31. The method of claim 29, wherein the other related, 
discrete finite elements include discrete finite elements cor- 
responding to database records found in the searching step. 

32. The method of claim 31, wherein the step of con- 
structing the new informational resource includes the step of 
ordering the other related, discrete finite elements and the 
accessed, relevant discrete finite elemeat according to infor- 
mation contained within the categorical tags of the other 
related, discrete finite elements and the accessed, relevant 
discrete finite element 

***** 
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